DOCID: 3860926 



UNCLASSIFIED 



Cryptologic Quarterly 



(U) A History of Secure Voice Coding: 

Insights Drawn from the Career of One of the Earliest 
Practitioners of the Art of Speech Coding 



JOSEPH P. CAMPBELL, JR., and RICHARD A. DEAN 



Editor's Note: This article Is based on one published In Digital Signal Processing, July 1993, with permission ofthe 

authors. 



The history of speech coding is closely tied to 
the career of Tom Tremain. He joined the 
National Security Agency in 1959 as an Air Force 
lieutenant assigned to duty at the Agency. Little 
did he know then that this assignment would 
shape his career as well as the future of speech 
coding. ‘ 

Thomas E. Tremain was the U.S. govern- 
ment’s senior speech scientist. He was a recog- 
nized leader and an expert in speech science. 
Tom’s work spanned five decades of state-of-the- 
art modem and speech coding innovations that 
are the basis of virtually every U.S. and NATO 
modem and speech coding standard. His efforts 
have been critical to U.S. arid NATO tactical and 
strategic secure communications programs. 

I Editor's Note: Since the original article was published, 

■ Mr. Tremain passed away on October 5, 1995.] 

‘ l 

NSA inherited the responsibilities, traditions, 
arid expertise of voice coding for enciphered 
speech applications from the Army Security 
Agency in 1952. The engineers and technicians 
who had participated in the famous SIGSALY 
vocoder used by Roosevelt and Churchill for plan- 
ning the “D-Day” invasion [1, 2] were still devel- 
oping their craft at NSA. SIGSALY, shown in 
Figure 1, was a vocoder-based system related to 
the ‘Talking Machine” firstl introduced by Homer 
Dudley of Bell Labs at the 1939 World’s Fair. 
Developed with Bell Labs, it consisted of a bank of 

j * 

ten bandpass filters spaced approximately at the 
bands of equal articulation for speech, from base- 
band up to 3000 Hz. Each|filter could be excited 
by one of six logarithmically spaced amplitudes 
developed by Hany Nyquist in the first applica- 



tion of PCM. A “Buzz”/“Hiss” generator was used 
as an exciter for the vocoder corresponding to the 
voiced/unvoiced attribute of each 20-ms speech 
segment. Balance of the “Buzz”/“Hiss” generator, 
or voicing, represented a major factor in the qual- 
ity of the speech. Early practitioners of speech 
coders, like Tom, can still be found today speak- 
ing “Aaahhh”/“Sshhhhh” into voice coders to test 
this balance. 

From the time of SIGSALY until Tom arrived 
at NSA, several generations of voice coders had 
been developed in conjunction with Bell Labs. 
The KO-6 voice coder, developed in 1949 and 
deployed in limited quantity, was a close approx- 
imation to the 1200-bps SIGSALY voice coder. 
This was followed in 1953 by the 1650-bps KY-9 
shown in Figure 2. Using a 12-channel vocoder 
and hand-made transistors, it was one of the ear- 
liest applications of solid-state technology. This 
reduced the weight of SIGSALY’s vacuum tube 
technology from 55 tons to a mere 565 pounds! In 
1961, Tom’s first project was the development of 
the HY-2 vocoder, the last generation of U.S. 
channel vocoder technology. The HY-2, shown in 
Figure 3, was a 16-channel 2400-bps system 
using “Flyball” color-coded modular logic to 
reduce the weight to 100 pounds. Between 1962 
and 1964, Tom created the first simulation of a 
channel formant vocoder in a mainframe digital 
computer and, between 1966 and 1968, he helped 
develop the first digital channel vocoder. 

Even the best of U.S. vocoder technology at 
that time was limited by the analog technology 
that was the basis of its implementation. As the 
analog filters and amplifiers vary with age and 
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J Fig. 1. A view of SIGSALY through the rear ioors of a tractor trailer. 
! The first real-time secure voice system, circa 1944 



i \ 

temperature, so does the sensitive tracking Coding (LPC) generation of voice coders. Tom’s 

required between the speech analyzer and speech predecessors and contemporaries never quite 

synthesizer. Performance in the field never understood how the deck of punched cards he 

approached laboratory performance; users, start- carried down the hall to a Honeywell computer 

ing with Churchill and Roosevelt [2], were reluc- was ever going to result in a vocoder. Tom devel- 

tant to use systems ^hat had a synthetic “Donald oped the subtle techniques necessary to load and 

Duck” quality. President Johnson refused to use invert a matrix in real time in the fixed-point 

the HY-2 because of its poor quality and, as a arithmetic necessary for such an operation. A for- 

result, deployment was li mited. ; ✓ ' mer director of research at NSA ridiculed Tom for 

suggesting such an outlandish proposal; that very 
Tom’s most significant contribution to voice method is common practice today. So that sys- 

coding was in seeing the possibilities of digital terns could run faster on the computers of the 

signal processing for voice. In an era when the day, he developed new constructs, such as the 

state of the art was analog tuned circuits, Tom Average Magnitude Difference Function (AMDF) 

imagined a change to computer-based processing as a replacement for the multiply-intensive auto- 

of speech - a radical shift in thinking. Tom pio- correlation function. Finally, in 1974 Tom 

neered this new approach, again collaborating demonstrated the first real-time computer simu- 

with Bell Labs, to develop the Linear Predictive lation of LPC-10 on the CSP-30 computer, a mile- 

rl 
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i Fig. 2. 
The KY-9 
transistorized 
secure voice 
system, 
circa 1955 





Tom Tremain at work on ajprototype 1960s era Channel 
Vocoder 
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stone in signal processing history. This led 
to a whole new family of NSA vocoder 
products, the STU products, built around 
the first generation of AMD2901/TRW 
multiplier-based high-speed bit-slice sig- 
nal processors that forever changed the 
way voice coding was accomplished. 
Today’s STU-III, shown in Figure 4, is the 
third-generation desktop telephone that 
uses an enhanced LPC-10 and supports 
secure voice users throughout the govern- 
ment. Voice coders, long associated only 
with exotic encryption schemes, are find- 
ing numerous applications today for wire- 
less communications, voice mail, and syn- 
thetic voice applications. Today’s voice 
coders used in satellite 'communications 
and. hand-held digital cellular telephones 
are direct successors of Tom’s early work. 



Page 99 






DOCID: 3860926 



Cryptologic Quarterly 



UNCLASSIFIED 




Fig. 3. The HY-2 channel vocoder, circa 1961 
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{ Fig. 4. The STV-III secure voice terminal family, circa 1986 
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Tom was the chairman of the U.S. govern- 
ment’s Digital Voice Processing Consortium for 
; over twenty years. He reported directly to 
. the under secretary of defense for command, con- 
I trol, communications, and intelligence. The 
^ Consortium is the recognized U.S. government 
j forum for voice algorithm developments. It pro- 
i motes innovative research, removes duplication, 
evaluates new algorithms, and promotes technol- 
j ogy insertion. Under Tom’s leadership, the 
■ Consortium became a prestigious U.S. forum for 
advancing state-of-the-art speech technology 
j research. The Consortium activities and Tom’s 
; personal designs have had a far reaching impact 
] on operational and planned communications sys- 
j terns totaling billions of dollars of national assets. 

' His voice compression work facilitated high-qual- 
ity secure communications worldwide. Senior 
I U.S. government officials, national communica- 
tions leaders, and other senior scientists ffe- 
, quently sought his opinions. He instituted speech 
! testing comparison standards, promoted the 
establishment of a world-recognized independent 
, test facility, initiated innovative voice compres- 
I'sion development programs, and started and 
monitored voice research programs at universi- 
ties. His original work bn continuously variable 
! slope delta modulation,! linear predictive coders, 
I vocoders, adaptive predictive coders, code excited 
| linear predictors, modem technology, channel 
{ 'simulation, and intelligibility' and quality test 
! methodology forms the basis of most voice com- 
.munications used today. For these efforts, the 
secretary of defense awarded Tom with U.S. DoD 
, .Meritorious Civilian Service Medals in 1985 and 
"1992. J 

I 

, Tom chaired many of the IEEE ICASSP and 
> Speech Tech sessions and was a primary organiz- 
, er of early ICASSPs and Speech Coding 
Workshops. He personally disseminated 
throughout the world the LPC source code that 
enabled much of today’s research. He was solely 
responsible for LPC being the ubiquitous low-rate 
speech coder of its day.; He published numerous 
extensively referenced papers. One of his papers 



[3] on LPC is perhaps the most widely referenced 
paper in speech coding. 

Tom’s later achievements were as numerous 
and important as his long-term successes. His 
team’s Code Excited Linear Prediction (CELP) 
speech coder, used in the STU-III, was endorsed 
as Federal Standard 1016 and proposed for a 




Tom Tremain Cleft) was recognized by the IEEE In 
1993'with a special award "Forhis major cpntril?u-; • ' 
tions and sustained leadership in digital speech cod- 1 



ing, modem technology, and U.S. and NATO stan- 
dards for speech coding and communication-' 14] 

NATO standard. A modified form of CELP is the 
basis of the algorithm used in North American 
digital cellular systems. 

Tom was also the chairman of the NATO 
Narrowband Speech Working Group where he 
introduced low bit rate speech coders for HF 
ECCM and VHF/UHF ECCM applications. He led 
research in low bit rate speech coding designs by 
developing vector quantization approaches using 
split codebooks and tree-based algorithms for 
fast search of large codebooks. These techniques 
achieve equivalent 2400-bps‘speech intelligibility 
at only a 600-bps rate. 
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In looking back on Tom’s career, one might 
point to the voice standards that he set or the- 
organizations that he chaired as a measure of his 
overall contribution. Tom would more likely have 
pointed not to his own research accomplish- 
ments, but rather to the new generation of speech 
researchers that he mentored who still carry on 
his work. 



For further information regarding the subject 
of this article please contact the authors at the fol- 
lowing email addresses: j.campbell@ieee.org and 
rdeanl@home.com 




• The authors and their colleagues. Photo was taken in front of "SIG5ALY" exhibit at its October 13, 2000, unveiling 
| ceremony at the Nationalj Cryptologic Museum. Those pictured span four generations of voice coding and Infor- 
j matfon security researchers. From left: Mary 'Tina" A. Kohler, who worked on code excited linear predictive coding 
i CCELP) and led the evaluation ofthe current generation of MELP vocoders; Joseph P. Campbell, Jr., who worked on 
; LPC and led the CELP team; Mitchell R, Brown, who dismantled SIGSALYs and developed analog vocoders; Mahlon 
j E. Doyle, who operated a SIG5ALY and became a leading cryptographic designer; and Richard A. Dean, who worked 
. on digital vocoders (including LPC). 
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